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SYSTEM FOR AUTOMATIC GENERATION OF ARBITRARILY INDEXED 
HYPERLINKED TEXT 

FIELD OF THE INVENTION 
[0001] The present invention relates generally to computer system text processing, 
and more particularly, to a system for automatic generation of arbitrarily indexed 
hyperlinked text. 

BACKGROUND OF THE INVENTION 

Statement Of The Problem 

[0002] A "hyperlink" is defined as a mechanism implemented via a computer system 
which allows a viewer to link (or jump) from a source location in a document to a destination 
location in the same or different document. The hyperlink is the combination of a display 
element or an indication (generally visual) that a hyperlink is available for a particular 
hyperlink source, and a computer program (typically a web browser) which finds and 
displays the hyperlink destination. A hyperlink thus provides a computer-assisted way for a 
human user to efficiently move between various locations containing information which is 
somehow related. 

[0003] A significant problem with generating information for computer-based 
hyperlink systems is that an author (system developer) must review the material to be 
hyperlinked, identify topics to which to hyperlink, and must also manually set up the 
hyperlinks. This is a time-consuming and labor-intensive process. Some of the disadvantages 
of manually creating arbitrarily indexed hyperlinked documentation for include: 

[0004] — Since commands can be indexed in arbitrary groups (not necessarily 
alphabetically), hand editing is required. This is not desirable, since manual entry of the 
necessary hyperlinking tags frequently results in mistyped links, rendering the 
documentation useless. 

[0005] — Since care must be taken to ensure that hand-generated links are accurate, 
the process of creating documentation manually is very slow and tedious. Therefore, 
documentation may be out-of-date and no longer relevant. 

[0006] Both of the above problems are exacerbated when there are hundreds of 
commands to hyperlink to each other, and also when the documentation is frequently 



updated. In addition, after the documentation is updated, it must be updated on the web 
server. This may be a problem if the person maintaining the documentation is not a web site 
administrator, and thus must wait until an administrator is available to make the necessary 
updates. 

[0007] Furthermore, every time a change is made to a document of interest, the author 
must manually update links to the changed documentation. 

[0008] What is needed is a system that automatically generates hyperlinks from 
indexed text in one document to one or more related topics in another document. There is 
also a need for a system that automatically updates the hyperlinks generated thereby to 
Internet-accessible web pages. 

Solution To The Problem 

[0009] The present system solves the above problems and achieves an advance on the 
field by providing a system that indexes text according to an arbitrary (i.e., not necessarily 
alphabetical) set of indices and automatically generates hyperlinks from each index to a 
related topic in a document. By automating the document linking and updating process, the 
chances of introducing errors into the hyperlinking process are greatly reduced, if not 
eliminated. Documentation is kept up-to-date and relevant. Furthermore, the documentation 
is instantly available over the Internet, since the present invention automatically copies all of 
the necessary files to the appropriate web server. In addition, it is no longer necessary to wait 
for the web server administrator to make updates to new or modified documentation. 

[0010] A text file containing the text of the documentation of interest is used as input 
to a hyperlink processing program that generates a file containing links to all of the topics of 
interest in the documentation. In one embodiment of the present system, the hyperlink 
processing program performs the steps set forth below. 

[0011] A category file is created that indicates an association between each of the 
topics and corresponding subjects included in each topic; 

a data structure is generated that associates each topic with corresponding 
subject names in accordance with the association defined in the category file; 

a plurality of subject name files is generated, each including HTML (hypertext 
markup language) text corresponding to an associated subject; 

a global index file comprising said hyperlinks to each of the subject name files 
is generated; 
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a plurality of index files is generated, each including hyperlinks between each 
of the topics and corresponding subject name files in accordance with the data 
structure; 

a list of category names corresponding to a primary set of indices is displayed 
in a first window, using hyperlinks in the index files; 

a secondary set of said indices comprising a list of subject names is displayed in 
a second window, in response to a user selecting one of the category names; and 

in response to a user clicking on one of the subject names in the second 
window, the text in the subject name file corresponding to the subject name selected 
by the user is displayed in a third window, using hyperlinks in the global index file. 

[0012] The present system also automatically updates the associated hyperlinks to 
other information located on the Internet-accessible web pages. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0013] Figure 1 illustrates a web page showing the use of hyperlinks in an exemplary 
embodiment; 

[0014] Figure 2 is a diagram showing the data organization in one embodiment of the 
present system; and 

[0015] Figures 3, 4A, and 4B are flowcharts illustrating an exemplary sequence of 
steps performed in carrying out one embodiment of the present system. 

DETAILED DESCRIPTION 
[0016] Figure 1 illustrates a web page screen display 100 showing the use of 
hyperlinks in an exemplary embodiment of the present invention. A hyperlink is an active 
area ('hot area') in a displayed document that when 'clicked' on (e.g., by a mouse), causes 
the user's browser to load an entity referenced by a URL (universal resource locator, which 
specifies the Internet address of the referenced item ). 

[0017] An anchor is an area within the content of a hypertext node (e.g. a web page) 
which is the source or destination of a hyperlink. A source anchor may be a word, phrase, 
image, or possibly the whole node. A destination anchor may be a whole node or some 
position within the node. Typically, clicking with a mouse on a source anchor causes the link 
to be followed (by the user's web browser) and the anchor at the destination end of the link to 
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be displayed. A destination anchor is simply an anchor placed in a file at a hyperlink 
destination. 

[0018] Anchors are typically highlighted in some way (either always, or when the 
mouse is over them), or they may be marked by a special symbol. In HTML (HyperText 
Markup Language), anchors are created with the <A..>..</A> construct. The opening A tag 
of a source anchor has an HREF (hypertext reference) attribute giving the destination in the 
form of a URL - usually a whole node or 'page'. Consider the following hyperlink 
definition: 

<a href =" next. html" target="window">click here</a> 
[0019] When a user clicks on the 'hot area' (source anchor ) represented by "click 
here", the document "next.html" will be loaded into the window or frame named "window." 
The anchor (<a>) tag is the HTML feature for defining both the source and the destination of 
the hyperlink. 

[0020] As shown in Figure 1, a hyperlink is conceptually represented by arrow 103 
between a hyperlink source 101(2), which is located in a first data file, and a hyperlink 
destination 104(1), which can be located in the same or in a second data file. As shown in 
Figure 1, text window 108 includes a plurality of (i.e., n) textual or graphical hyperlink 
source entities 101 (*) [where the '*' symbol indicates a wildcard representing any member of 
the class of items having the same basic reference number], indicated as Index (1) - Index(n). 
Each text entity (hereinafter referred to as an index) 101(*) has an associated HTML source 
anchor. For example, Index(2) [reference number 101(2)] has the associated anchor 102, 
which has Subindex(l) 104(1) as its destination in target window 109. The hyperlinked 
association between Index(2) and Subindex(l) is indicated by arrow 103. Alternatively, a 
source anchor in window 108, such as index(2) may have a destination anchor in a different 
window, as indicated by arrow 107 to destination ('target') text 111 in window 110. 

[0021] The present example is set in the context of a web page 100 that is used for 
displaying 'help' information about a system. For example, the items depicted by indices 
Index(l) - Index(n) [reference numbers 101(10 - 101(n)] may be categories of system 
commands, such as "script" commands or "run" commands, for which a user desires 
information. One of the text entities, e.g., Index(l), may be a category that includes all of the 
commands (or other entities) for which a target text entity 111 may be displayed. 
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[0022] When a user of the present system selects a particular command category, by 
clicking a mouse on the category name, a list of all of the commands in the category are 
displayed in a separate window. For example, if Index(2) in window 108 indicates "run 
commands", when the user selects (clicks on) the corresponding hot area 101(2) associated 
with Index(2), the text entities (sub-indices) Subindex(l), Subindex(2), and Subindex(3) in 
window 109 [shown by reference numbers 104(1) - 104(3)] will then be displayed as a list of 
members of the selected category of commands such as "continue", and "continue_one", and 
"step", respectively. 

[0023] If the user then desires to see the specific help information, e.g., for the "step" 
command, the hot area [Subindexl04(3)] for this command is selected (clicked on) in 
window 109, and the associated ('target') text 111 is displayed in main window 110 on the 
web page screen display 100. The mechanism by which these operations are performed is 
explained below in detail with reference to Figures 2, 3, 4A and 4B. 

[0024] Alternatively, the items depicted by text entities Index(l) - Index(n) may be 
topics in a document (instead of system commands), in which case the text entities 
Subindex(l) - Subindex(m) are related subtopics or subjects, i.e., names of members of the 
class denoted by the selected Index 101(*). In this case, the target text 111 displayed in 
target window 110 describes the selected subtopic or subject, rather than a system command. 

[0025] The HTML frame for the screen display 100 is generated by copying common 
files, for example, a .gif file for the basic web page frame graphics, a .ess file that describes 
the frame, and the index.html frame (the display screen html frame that points to the text for 
windows 108/109. These common files are stored in a known location, such as in the 
Command Name directory 212, described below with respect to step 435. 

[0026] Figure 2 is a block diagram illustrating basic data components in an exemplary 
embodiment of the present system. As shown in Figure 2, Help Text File 210 contains all of 
the help documentation, in plain text or ASCII format, for the system of interest. Operation 
of the present system is best understood by reference to Figure 2 in conjunction with the 
flowcharts of Figures 3, 4A, and 4B, as described below. Figures 3, 4A, and 4B are 
flowcharts illustrating an exemplary sequence of steps performed in carrying out one 
embodiment of the present system. As shown in Figures 2 and 3, at step 305, all of the help 
text documentation (or other documentation of interest) in a software program or other file 
(the 'initial documentation') 205 is loaded into Help Text File 210. 
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[0027] At step 310, Help Text File 210 is then processed by a hyperlink processor 
program, the details of which are explained below with respect to Figures 4A and 4B. Table 
1, below, shows an exemplary format for three of the entries in the Help Text File 210, which 
contains a number of entries equal to the number of commands or subjects which may be 
separately referenced (e.g., the number of text entities 111 which can be separately displayed 
in target window 110). Each entry in the file is separated by the delimiters "Command:" and 
"End:", and the delimiter "Help:" is employed to indicate the beginning of the help text for 
each command entry. These delimiters are generated and inserted when the Help Text file 
210 is generated. 

TABLE 1 Help Text File 

Command : 

breakpoint enable 

breakpoint enable <index> [<cnt>] 
Help : 

Enable breakpoint number <index>. If <index> is -1, enable all 
breakpoints. If <cnt> is provided, the breakpoint will be 
skipped the first <cnt> times it is hit. 

Use the command 'breakpoint list' for a list of all breakpoints. 
End: 



Command : 

create parisc 

create parisc <parisc> 

Help: 

Create a parisc template named <parisc>. <parisc> may then be 
used in the 'create dna ' command. See help for 'param parisc' 
for available parameters and default values 

End: 

Command : 
Step 

step [<count>] 
Help: 

Single step the current cpu for <count> instructions. 
If <count> is omitted, single step one instruction. 

See 'help continue' for a description of events that will cause 
an early termination of this command. 
End: 



[0028] As shown in Figure 4A, at step 410, Global Index file 214, containing a list of 
all of the commands (or other index words), is created by reading Help Text file 210 and 
inserting, in order, each of the command names found in file 210 into the Global Index file 
214. 

[0029] The HTML code for the hyperlinks used in the present system is generated at 
this step. The Help Text file 210 and the Global Index file 214 are generated using two input 
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files, the text documentation files in the initial document 205, and a manually generated Main 
Category file 215. From these two files, Global Index file 214, an Index Category file 230(*) 
for each category in the Main Category file 215, and a Command Name file 220(*) for each 
command in the Help Text file 210, are generated. These files are described in detail below. 
[0030] Exemplary contents of Global Index file 214 are shown below in Table 2. 

TABLE 2 Global Index File 

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 

Transitional//EN" "http://www.w3.org/TR/REC-html40/loose.dtd> 

<html> 

<head> 

<title>All Commands</title> 

<link REL=" STYLESHEET " TYPE="text/css" HREF=" . . /beta . css"> 
</HEAD> 
<body> 

<div class="f rametitle">All Commands 
<br> 

<a href="script. exit .html" target="helpFrame"> . exit</ a> 
<a href =" script . gosub . html " targe t="helpFrame ">. gosub</a> 
<a href ="script . goto . html" target="helpFrame "> . goto</a> 



<a href ="address. addr.html" target="helpFrame ">address addr </a> 
<a href ="address . encode . html" target="helpFrame">address encode 
</a> 

<a href ="address. rope. html" target="helpFrame">address rope </a> 



<a href="run. step.html" target="helpFrame">step </a> 



</body> 
</html> 



[0031] At step 412, a set of command (or subject) categories is determined, using 
arbitrary category names as desired. Step 412 may be performed at any time prior to creating 
the Main Category file in step 415, below. 

[0032] At step 415, a Main (command) Category file 215 is manually created. Each 
of the commands (or alternatively, subjects), for which help text is generated, is manually 
assigned to one or more named categories, and placed in the Main Category file accordingly. 

[0033] Each category is delimited by a percent sign in the first column of the line. 
The name of the category follows and has no restrictions. Each command that belongs in a 
particular category is listed on a separate line. Every command from the start of a particular 
category to the start of the next category is considered to be a command belonging to that 
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particular category. Commands may belong to any number of categories, as shown in Table 
3 below, wherein the "source" command is included in the "Script Commands" category, as 
well as in the category named "Test Category". 

[0034] An exemplary segment of Main Category file 215 is shown in Table 3, below. 

Table 3 Main Category File 

% Script Commands 

.exit 

. gosub 

. goto 

. if 

. ifdef 
. return 

source 
source_base 

% Run Commands 
step 

step_all 
step_over 
continue 
continue_one 

# The next category illustrates the fact that a command may # belong 
to more than one category 
% Test Category 
source 

step^all . 

[0035] At step 420, Main Category File 215 is read to determine the indices in which 
each command should be placed. At step 425, each command (or subject) in Main Category 
File 215 is associated with one or more indices, using the correspondence set forth in the file 
215. At step 430, each command (or subject) is inserted into Data Structure 216, wherein 
each command is associated with the appropriate indices as indicated by the command-to- 
category association of step 425. The data structure shown below in Table 4 is a Perl 
language hash table, wherein each hash has multiple values per key, and each key is an array. 
It should be noted, however, that any type of array mechanism could be employed to provide 
the desired association between each command and one or more relevant categories. 
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TABLE 4 Data Structure for Indexing Commands 



My $type = ""; 






My $ cmd = " " ; 






My $cmd type = ""; 






Open(TMP, $ categories ) ; 






# Read in the Main Cate 


jory file and store it 


into a global hash. 


# Commands are allowed to belong to more than 


one category, 


# which is why an array 






# (hashes with multiple 


values per key) 




while (<TMP>) { 






if(/~#/) { 






next; # the 


#' is a comment. Skip 


it . 


} elsif (/ A \s/) { 






next; # skip 


blank lines 




} elsif(/n/) { 






s/ A %\s+//; # trim 


hash mark and any # le 


ading whitespace 


chomp; # trim 


trailing whitespace 




$type = $ ; 






} else { 






chomp; 






push( @{$cmds{$ }}, 

} 

} 


$type) ; 




close (TMP) ; 







[0036] At step 433, text in Help Text File 210 is copied to an intermediate working 
directory, referred to as Command Name directory 212. Global index file 214, command 
name files 220(*) and index category files 230(*) (described below) are also copied to this 
directory. A segment of a subdirectory named "Help", containing HTML files for the 
individual command categories, as well as the command names, located in command name 
directory 212, is shown below. 

Run_Commands . nav . html 
Script_Commancls . nav. html 
address . addr . html 
address . asel . html 
address . decode . html 
address . encode . html 
address . nav. html 
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[0037] The Run_ Commands.nav.html file, for example, contains the following list of 

commands: 

continue 

continue_one 

step 

step_all 
step_over 

[0038] All of the files generated by the present system are located in the Command 
(subject) Name directory 212. At step 433, Index Category files 230(*) are present, but do 
not yet include any hyperlinks. Hyperlinks are added at step 470, described below. 

[0039] Figure 4B illustrates an exemplary instruction loop for parsing the Help Text 
file 210 to create Command (subject) Name files 220(1) -220(n) and Index files 230(1) - 
230(m). As shown in Figure 4B, at step 440, one or more lines of data is read from Help 
Text file 210. 

[0040] At step 445, if a beginning (command name) delimiter (e.g., "Command") is 
encountered in the data read from the Help Text file 210, then an Command Name file 220(*) 
is created for the term appearing in the line in the file 210 following the delimiter. Each 
Command Name file 220(*) contains HTML-encoded text for the associated command (or 
subject). To help illustrate the steps shown in Figure 4B, the "step" command shown in Table 
1 is used as an example: 

Command : 
step 

step [<count>] 
Help : 

Single step the current cpu for <count> 

instructions . . . 

End: 

[0041] In the above example, it can be seen that the line following the beginning 
delimiter "Command" contains the text "step". In accordance with the present system, a 
Command Name file 220(*) containing the HTML code for the present command is 
generated at step 460. In the present example, a Command Name file 220(*) named 
"step.html" containing the HTML code for the "step" command is generated. The HTML for 
step.html is shown below in Table 5. 
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TABLE 5 step.html 



< ! DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 

Transitional //EN" "http: //www. w3 . org/TR/REC-html 4 0 /loose . dtd> 
<html> 
<head> 
<title>step </title> 

<link REL= " STYLESHEET " TYPE=" text/ess " HREF=" . . /beta . css"> 
</HEAD> 
<body> 

<table width="100% "xtrxtd align="right "xdiv class = "ovlink"><a 
href =" . . / . . /ove 

rview.html">Overview</a></div></td></trx/table> 
<table> 
<tr> 

<tdximg src=" . . /archsimt . gif "></td> 

<td> 

<div class="helpversion">halfdome 2.0.17</div> 
<div class="f rametitle">generated Thu Jun 28 12:47:54 
2001</divx/td> 
</td> 
</tr> 
</table> 
<hr> 

<div class="f rametitle">step </div> 
<br> 

usage: <div class="usage">step [ &lt ; count &gt ; ] 

</div> 

<br> 

description : 

<pre class="desc"> 

Single step the current cpu for &lt ; count&gt ; instructions. 
If < count&gt ; is omitted, single step one Instruction. 

See 'help continue' for a description of events that will cause 

an early termination of this command. 

</pre> 

<hr> 

</body> 

</html> 



[0042] At step 465, a hyperlink is created from Global Index File 214 to the 
appropriate Command Name HTML file 220(*). For example, the hyperlink for the "step" 
command might appear as: 



<div class="f ramelink"><a hre f=" step . html " 
target="helpFrame">step </a></div> 

[0043] The above hyperlink links Index(n) [reference no. 101(n)] in window 108 with 
the corresponding Subindex(m) [reference no. 104(m)] in window 109. 

[0044] At step 470, hyperlinks are then created from the indices (categories) in data 
structure 216 to the associated Command Name files 220(*). As an example, the Index 
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(category) file containing hyperlinks from the Run Commands category to the associated 
commands in that category is shown below in Table 6. 

TABLE 6 

File name: Run_Commands.nav.html 

<IDOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 

Transitional / /EN" "http : / /www . w3 . org/TR/REC-html4 0 /loose . dtd> 

<html> 

<head> 

<title>Run Commands</t i tle> 

<link REL=" STYLESHEET" TYPE=" text/ess" HREF=" .. /beta . ess "> 
</HEAD> 
<body> 

<div class=" f rametitle">Run Commands</div> 
<br> 

<a href="continue .html" target="helpFrame ">cont inue </a> 

<a href =" continue^one . html " target="helpFrame">continue_one </a> 

<a href="step.html" target="helpFrame ">s tep </a> 

<a href="step__all .html" target="helpFrame">step_all </a>< 

<a href="step_over.html" target="helpFrame">step__over </a> 

</body> 

</html> . . . 

[0045] Note that, for example, the above hyperlink "step.html" links Subindex(m) 
[reference no. 104(m)] in window 109 with target window 110, to display corresponding help 
text 111 (i.e., the text display generated by step.html) therein. 

[0046] A branch back to step 435 is then taken, where a check is made to determine 
whether there is any remaining unread data in Help Text file 210. If all of the data in the file 
210 has been read, then the parsing of Help Text file 210 is completed, and processing 
resumes as shown in Figure 3, beginning at step 315 (described below). If more data remains 
in the Help Text file 210, then steps 440 through 445 are repeated, as described above. 

[0047] At step 445, if a beginning delimiter is not encountered, then a check is made 
for a "Help" delimiter, identifying the beginning of the help text, at step 450. If a Help 
delimiter is found, then, at step 455, the help text is copied from Help Text file 210 to the 
appropriate Command Name HTML file 220(*). Processing of Help Text file 210 then 
continues, at step 435, until the end of the file is encountered. If a help delimiter is not found, 
then, by default, the remaining data (line) in the file must be an "End" delimiter, and the 
Command Name HTML file 220(*) that was just generated is closed, at step 451. 

[0048] At step 315, the hyperlinked output generated in the above steps (shown in 
Figures 4A and 4B) is copied to the appropriate web server (not shown). Finally, at step 320, 
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Links on web page 100 are updated by the present system to point to the new (or modified) 
version of the help documentation created in accordance with the process described above. 

[0049] While exemplary embodiments of the present invention have been shown in 
the drawings and described above, it will be apparent to one skilled in the art that various 
embodiments of the present invention are possible. For example, the specific sequence of 
steps described above in Figures 1, 3, 4A and 4B, as well as the particular files and 
directories shown in Figure 2, should not be construed as limited to the specific embodiments 
described herein. Modification may be made to these and other specific elements of the 
invention without departing from its spirit and scope as expressed in the following claims. 
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